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Abstract — A key challenge in wireless networking is the man- 
agement of interference between transmissions. Identifying which 
transmitters interfere with each other is a crucial first step. 
Complicating the task is the fact that the topology of wireless 
networks changes with time, and so identification may need to 
be performed on a regular basis. Injecting probing traffic to 
assess interference can lead to unacceptable overhead, and so 
this paper focuses on interference estimation based on passive 
traffic monitoring. We concentrate on networks that use the 
CSMA/CA (Carrier Sense Multiple Access/Collision Avoidance) 
protocol, although our model is more general. 

We cast the task of estimating the interference environment 
as a graph learning problem. Nodes represent transmitters and 
edges represent the presence of interference between pairs of 
transmitters. We passively observe network traffic transmission 
patterns and collect information on transmission successes and 
failures. We establish bounds on the number of observations (each 
a snapshot of a network traffic pattern) required to identify the 
interference graph reliably with high probability. 

Our main results are scaling laws telling us how the number 
of observations must grow in terms of the total number of 
nodes n in the network and the maximum number of interfering 
transmitters d per node (maximum node degree). The effects 
of hidden terminal interference (i.e., interference not detectable 
via carrier sensing) on the observation requirements are also 
quantified. We show that it is necessary and sufficient that the 
observation period grows like d 2 log n, and we propose a practical 
algorithm that reliably identifies the graph from this length of 
observation. The observation requirements scale quite mildly 
with network size, and networks with sparse interference (small 
d) can be identified more rapidly. Computational experiments 
based on a realistic simulations of the traffic and protocol lend 
additional support to these conclusions. 

Index Terms — Interference graph learning, CSMA/CA proto- 
col, minimax lower bounds 

I. Introduction 

Due to the broadcast nature of wireless communications, 
simultaneous transmissions in the same frequency band and 
time slot may interfere with each other, thereby limiting 
system throughput. Interference estimation is thus an essential 
part of wireless network operation. Knowledge of interference 
among nodes is an important input in many wireless network 
configuration tasks, such as channel assignment, transmit 
power control, and scheduling. A number of recent efforts 
have made significant progress towards the goal of real- 
time identification of the network interference environment. 
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Some of the recent approaches (e.g., Interference maps Q 
and Micro-probing (3)) inject traffic into the network to 
infer occurrences of interference. While such approaches can 
be quite accurate in determining interference, the overhead 
of making such "active" measurements in a large network 
limits their practicality. In addition, since network topology 
often changes over time, the periodic use of active probing 
methods to identify interference can place a significant burden 
on the network. Since the network topology changes at a 
relatively slow time scale, it is reasonable to assume that 
the interference among nodes stays constant over a period of 
time. We study an interference learning problem where the 
underlying interference pattern among the nodes stay fixed 
during the observation period. The theoretical results we obtain 
indicate the time scale over which the interference patterns can 
be identified and tracked. 

Inspired by two passive WLAN (wireless local area net- 
work) monitoring approaches Jigsaw J4], and WIT (6), a 
different approach to online interference estimation "Passive 
Interference Estimation" (PIE) is explored in 13, H). Rather 
than introducing active probe traffic, interference estimates 
are based on passively collected data. PIE estimates the 
interference in the network in an on-line fashion. All interfer- 
ence estimates are obtained by observing ongoing traffic and 
feedback information at transmitters. Experimental studies in 
small testbeds (8) show that PIE is quite promising, but very 
little is understood about how the method scales-up to large 
complex networks. That is the focus of this paper. 

The main contribution of this paper is to quantify the 
required observation time as a function of network size and 
topological connectivity. This provides insight into the time 
scale over which network interference patterns can be identi- 
fied, and tracked, in dynamically changing wireless networks. 
Our result is applicable to WLANs and other wireless commu- 
nication systems and technologies operating under CSMA/CA- 
like protocols such as, e.g., Wireless Personal Area Networks 
(WPAN) 13 and ZigBee llOl . 

We formulate passive interference estimation as a statistical 
learning problem. Given an arbitrary WLAN that consists of n 
access points (APs) and a variable number of mobile clients, 
our goal is to recover the "interference" or "conflict" graph 
among these APs with as few measurements as possible. This 
graph encodes the interference relations between APs and 
other APs' clients. We study three versions of the problem. In 
the first version we study "direct" interference between APs 
where edges in the graph indicate that a pair of APs are within 
each other's carrier sensing range. Letting d be the maximum 
number of interfering APs per AP, we show that to identify 
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the conflict graph one must collect a number of measurements 
proportional to d 2 log n. This is quite mild dependence on 
the network size n and indicates that interference graph 
inference is scalable to large networks and sparser patterns of 
interference are easier to identify than denser patterns. In the 
second version we quantify the effect of "hidden" terminal 
interference. This type of interference occurs when one AP 
interferes with another AP's clients, but the transmission of the 
interfering AP is not detectable by the other AP. In this case 
feedback on transmission successes and failures is required to 
estimate the graph. In the first two versions of the problem 
we assume the interference environment is static, i.e., time- 
invariant. In the third version we study both the direct and 
hidden interference problem when the channel state between 
pairs of devices can fluctuate randomly. This generalization 
captures effects such as deep fades (and poor channel state 
information) that can cause uncertainty as to whether a trans- 
mission failure is due to interference some other cause. For 
all versions of the problem we present easy-to-implement 
graph estimation algorithms. The algorithms are adaptive to 
d, in the sense that they do not require apriori knowledge of 

d. For the first two (static) versions we also develop lower 
bounds that demonstrate that the time-complexity attained by 
the algorithms cannot be improved upon by any other scheme. 

We adopt the following set of notations. We use upper- 
case, e.g., X, and bold-face upper case, e.g., X, to denote 
random variables and vectors, respectively. A vector without 
subscript consists of n elements, each corresponds to an AP, 

e. g., X := (Xi, X2, ■ ■ ■ , X n ). Sets and events are denoted 
with calligraphic font (e.g., £). The cardinality of a finite set 
V is denoted as |V|. A vector with a set as its subscript consists 
of the elements corresponding to the transmitters in the set, 
e.g., Qc := {Qcjcec- 

The paper is organized as follows. In Section [II] we formu- 
late the WLAN interference identification problem as a graph 
learning problem. We review the CSMA/CA protocol and 
propose a statistical model for a network using this protocol. 
In Section |III] we present our main results for the static setting 
in the form of matching upper and lower bounds (up to 
constant factors) on the observation requirements for reliable 
estimation of both direct and hidden interference graph. In 
Section IIVI we generalize the statistical model to account 
for random fluctuations in the interference environment and 
present achievable results for "robust" graph inference. We 
present an experimental study in Section [V] that supports the 
theoretical analysis. The experiments are based on simulations 
of the traffic and protocol that incorporate more real-world 
effects than the model used to develop the theory. However, in 
the experiments we do observe the scaling behavior predicted 
by the theory. Concluding remarks are made in Section |VI] 
Most proofs are deferred to the appendices. 

II. Problem Formulation 

In this section we present the problem setting. In Sec- 
tion IH-AI we present the important characteristics of the 
multiple-access protocol. In Section Hl-Bl we model the inter- 
ference environment using a graph. In Section Hl-CI we present 



the estimation problem and sketch our algorithms. Finally, in 
Section Hl-DI we present the statistical model and assumptions 
that underlie our analysis. 

A. CSMA/CA protocol and ACK/NACK mechanism 

We assume the system uses a CSMA/CA-like protocol at 
the medium access control layer, e.g., ifTTI . The important 
characteristics of the protocol are the following. When the 
transmitter has a packet to send, it listens to the desired 
channel. If the channel is idle, it sends the packet. If the 
channel is busy, i.e., there exists an active transmitter within 
the listener's carrier sensing range, the transmitter holds its 
transmission until the current transmission ends. At the conclu- 
sion of that transmission the following back-off mechanism is 
invoked. The transmitter randomly chooses an integer number, 
r, uniformly in the range [0, W— 1], and waits for r time slots. 
The positive integer W represents the back-off window size. If 
the channel is idle at the end of the node's back-off period, the 
node transmits its packet; otherwise it repeats the waiting and 
back-off process until it find a free channel. Statistically, the 
random back-off mechanism allows every node equal access 
to the medium. 

However, even if two APs are not within each other's carrier 
sensing range, the transmission from one AP may still corrupt 
the signal received at clients of the other. This is the so- 
called "hidden terminal" problem. To identify this type of 
interference further information is needed. We assume that 
an ACK/NACK mechanism is used. Specifically, we assume 
that whenever an AP successfully delivers a packet to its 
destination, an ACK is fed back to acknowledge the successful 
transmission. If the AP does not receive the ACK after a period 
of time, it assumes that the packet is lost. The ACK/NACK 
mechanism enables the APs to detect collisions however they 
occur. 

B. Interference Graph 

We use a graph G = (V, £) to represent the interference 
among APs in the network. The node set V represents the APs, 
and edge set £ represents the pairwise interference level among 
APs. An example of such of graph is depicted in Figure Q] 

We partition £ into two subsets: direct interference £d and 
hidden interference £h- Direct interference occurs when two 
APs are within each other's carrier sensing range. An example 
in Figure [T] is the AP pair (4,5). Under the assumption that 
the carrier sensing range is the same for every AP, the edges 
in £d are undirected. However, as mentioned above, carrier 
sensing cannot resolve all of the collisions in the network. 
Hidden-terminal type interference is represented by the edges 
in £h ■ Such interference may be asymmetric and so the edges 
in £h are directed. In Figure Q] the AP pairs (1,2) and (3,4) 
cannot detect each other. Yet, they can interfere with each 
other's clients since their carrier sensing ranges intersect. The 
effected clients would (roughly) lie in the intersection of the 
(roughly circular) carrier sensing ranges of the AP pair. Nodes 
3 and 5 are are an example of an AP pair that does not interfere 
(either directly or indirectly) since their carrier sensing ranges 
do not overlap. 
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Fig. 1. System model with 5 APs. The edges connecting APs represent the 
interference between them. Solid line represents direct interference, dashed 
lines represent hidden interference. Each circle represents the carrier sensing 
range of the AP at its center. Clients that are located in the intersection of 
the carrier sensing regions of a pair of APs may be subject to interference. 

One can further associate a probability or weight with each 
interference edge. This weight reflects the level (severity or 
persistence) of the interference. Our main interest is to iden- 
tify the presence of interference. Once this is accomplished, 
estimating the severity of interference is straightforward. Thus, 
our emphasis is on determining the presence of edges and 
not their weights. The sparsity of the interference graph is 
parameterized by d, the maximum node degree. This parameter 
is a bound on the number of other nodes that interfere with 
any particular node. 

C. Inference Problem and Algorithm 

Our objective is to determine the edge set £ based on passive 
observations. We assume the existence of a central controller 
that, over a period of time, collects the transmission status of 
all APs and information on transmission successes (ACKs). 
The collected information during this observation period is 
the dataset we use to estimate the interference graph. Both 
enterprise-type WLANs and proposed architectures for 4G 
systems that involve combinations of macro plus pico-cells 
have such central controllers. 

In this section and in Section [Til] we consider static channel 
states. Specifically, we assume that whether or not a pair of 
APs can sense each other is fixed (deterministic) throughout 
the period of observation. For hidden interference, we also 
assume that the channel between each AP and all clients 
of other APs is also static. Further, if a transmission failure 
occurs it must have been caused by a collision with a packet 
transmitted by one of the other active APs. Thus, for now 
we ignore other factors that contribute to transmission failures 
such as deep fades. In Sec. [IVJ we relax this assumption 
and take the randomness of the communication channels into 
consideration. We defer till then the formal definition of the 
system model with random channel states. 

We now outline the basic ideas underlying our algorithms. 
First, consider the direct interference environment, character- 
ized by £d- Note that due to the use of carrier sensing in 



CSMA/CA, if two APs are within each other's carrier sensing 
range, they will not transmit simultaneously. One can there- 
fore infer that any pair of APs that transmit simultaneously 
during the observation period must not be able to hear each 
other. In other words, there is no direct interference between 
them. The algorithm starts with a full set of OY ') candidate 
edges. Each time a simultaneous transmission is observed, the 
corresponding edge is removed. If the observation period is 
sufficiently long, all possible co-occurring transmission will 
be observed and the correct direct interference edge set Ed 
will be recovered. 

Next, consider the cases of hidden interference characterized 
by £h- Estimating this set is more involved and requires the 
collected ACK information. When a collision is detected at an 
AP, it implies that at least one of the other APs transmitting at 
the same time is interfering. The subset of APs transmitting at 
that time instance becomes a candidate set of hidden interferes 
for that AP. For each collision detected by the AP, another 
candidate subset is formed. The true set of hidden interferers 
for that AP must intersect with all the candidate sets. When 
the observation period is sufficiently long, we show that 
the minimum hitting set ifTZl that intersects with all of the 
candidate subsets is the set of hidden interferers. The edge set 
£ h can thus be recovered. 

Our approaches to both problems were inspired by the PIE 
algorithms proposed in Q, 0. For the direct interference 
problem, our approach is similar to PIE. Both rely on si- 
multaneous transmissions to infer the interference graph. Our 
results provide the theoretical analysis and scaling behavior 
characterization that complement the empirical results of Q, 
[81 ■ Our approach to the hidden interferences problem is quite 
different from Q, (8). While PIE uses a correlation based 
approach we use the hitting set approach mentioned above 
which attains more accurate results. 

D. Statistical Model with Static Channel States 

The graph Gd = (V,£d) represents the carrier sensing 
relationships among the APs. Specifically, if AP i and AP j 
are within each other's carrier sensing range, there is an edge 
between i,j, denoted as The existence of such an edge 

implies that AP i and AP j are close and the transmission of 
each can be observed by the other. An example in Figure Q] 
is the AP pair (4,5). We term this "direct" interference and 
Gd the direct interference graph. We define Mi to be the set 
of neighbors of AP i in Gd = (V,£d), and let di = 

Observations of the network activation status are taken at 
time epochs. We use Xj(t) 6 {0, 1} to denote the activation 
state of node i at time t: Xi(t) = 1 means that node i 
transmits; Xi{t) = means that node i does not transmit. In 
general, Xi(t) is determined by the traffic status and backoff 
times of the APs that compete for the same channel. 

As mentioned, to infer the hidden interference we required 
information on transmission successes and failures. Define 
Yi(t) <S {0, 1, 0} to be the feedback information information 
received at AP i at the end of session t. Yj (t) = 1 means that 
an ACK is received at AP i, indicating that the transmission in 
session t is successful; Yi(t) = means that the transmission 
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failed, caused by some simultaneous transmissions; Yi(t) = 
means that node i does not transmit in that session, i.e., 
Xi(t)=0. 

We let a graph Gh = (V,£h) represent the hidden 
interference among APs that cannot hear each other. This 
interference depends on the locations of the clients associated 
with each AP and thus may not be symmetric. These edges 
are therefore directed. We define 

Vij = F(Y 3 (t) = 0\Xi{t) = X,{t) = l,X VX{iii} (t) = 0) 

(1) 

i.e., pij is the probability that, when i,j are isolated from the 
rest of the APs and -Xj(t) = X (t) = 1, AP i interferes with 
AP j, causing transmission failure of AP j. 

The capture the randomness in locations of the clients 
associated with each AP. An AP may interfere with only a 
subset of the clients of a neighboring AP. For example, in 
Fig.[TjAP 1 may interfere with a client of AP 2 that is halfway 
between the APs, but it likely will not interfere with a client 
on the far side of AP 2. Thus, which clients one AP interferes 
with depends on the location of the other AP's clients. The 
value of represents the proportion of AP j's clients that AP 
i interferes with. It can be interpreted as the probability that AP 
j communicates with a client located in the overlapped area of 
the carrier sensing ranges of APs i and j. We don't define p^ 
for £ £d since such pairs of APs are within each other's 
carrier sensing range and thus never transmit simultaneously. 
This is the case for AP 4 and AP 5 in Fig. [TJ 

Define Sj = £ £h i £ V} to be the hidden 

interferer set for AP j, i.e., the set of APs with pij > 0. 
We let Sj = \Sj\. We point out that in general ViS C V\{i,j}, 

Pij < W(Yj(t) = 0\Xi(t) = l,Xj(t) = 1,X S = 0), (2) 

i.e., p^ is less than the probability that a collision occurs at 
AP j when both APs i and j are transmitting. This is because 
the collision at AP j may be caused by an active AP other 
than i, and AP i may just happen to be transmitting at the 
same time. 

The complete interference graph G = (V,£) consists of 
both direct interference graph and hidden interference graph, 
i.e., £ = £o U £h ■ We note that £d n £h = 0- We now state 
some statistical assumptions important in the development of 
our analytic results. 

Assumption 1 

(0) APs are synchronized and the time axis is partitioned 
into synchronized sessions where each session consists 
of a contention period followed by a transmission cycle. 
We use t — 1,2, ... to denote the indices of the sessions. 

(1) The traffic status Qi(t) are Ltd. Bernoulli random 
variables with common parameter p, where < p < 1, 
for all i £ V and all t £ Z + . In other words, the 
Qi(t) are independent across transmitters and sessions: 
P(Qj(t) = 1) = p for all i and t. APs competes for 
the channel at the beginning of session t if and only if 

QM 

(ii) The backoff time Ti{t) are continuous i.i.d. random 
variables uniformly distributed over [0, W] and are 



statistically independent for all i £ V and all t £ Z + . 
W is a fixed positive integer. 

(iii) For all £ £h, there exists a constant p m in, < 

Pmin ^ 1> ^- 1. Pij ^ Pvain- 

(iv) For all i £ V, there exists an integer d > 1, s.t. di < 
d-1. 

(v) For all j £ V, there exists s £ Z + , s.t. Sj < s. 

Assumption [T}(0) is essential for our analysis. In practical 
WLANs, APs competing for the same channel are "locally" 
synchronized because of the CSMA/CA protocol. Assump- 
tion QKO) approximates the original "locally" synchronous 
system as a synchronized one. Since APs far apart on the 
interference graph have a relatively light influence on each 
other's activation status, this assumption provides a good 
approximation of the original system and greatly simplifies 
our analysis. Assumption [D(i) ignores the time dependency 
and coupling effect of the traffic queue status of APs. It 
is a good approximation of traffic for a system operating 
in light traffic regime. Assumption [TJ-(ii) that the Ti(t)s are 
continuous rather than discrete quantities (as they are in 
CSMA/CA protocol), implies that with probability one no 
two adjacent nodes in Gd = (V, £d) will have the same 
back-off time. This is a valid assumption when W is a large 
integer. The first two assumptions guarantee that the joint 
distribution of Xi(t)s and Yi(t)s is independent and identical 
across t. Therefore, in the analysis hereafter we ignore the 
time index and focus on the distribution of XiS and YiS in 
one observation. Assumption [D(iii) defines the lower bound 
of significant interference. The final assumptions, (iv) and 
(v), model the fact that the interference graph will be sparse 
because of the widespread spatial distributions natural to the 
large-scale wireless networks of interest. 

III. Main Results for Static Channels 

In this section we present our main results for static chan- 
nels. We break the overall problem into four subproblems. 
In Section IIII-AI we provide an achievable upper bound on 
the number of observations required to infer the directed 
interference edge set £o- In Section IIII-BI we present a 
matching lower bound on the number of observations required 
to infer £]j. In Section UlI-CI we provide an achievable upper 
bound on the number of observations required to infer the 
hidden interference edge set £h- Finally, in Section UlI-DI we 
present a lower bound on the number of observations required 
to infer £h- 

A. An Upper Bound for Inference of Gd = (V,£d) 

In this section, we formalize the algorithm sketched in 
Section lll-Cl for estimating Gd = (V,£d) and present analysis 
results. In the static setting, for any £ £d, APs i 

and j can always sense each other's transmissions. For any 
AP pair $ £d, the APs can never detect each other's 

transmissions. 

Say that a sequence of transmission patterns X(l), X(2), 
. . ., X(fc) is observed. Due to the use of the CSMA/CA 
protocol and the continuous back-off time of Assumption [TJ- 
(ii), any pair of APs active in the same slot must not be able 
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to hear each other. Thus, there is no edge in Gd = (V,£d)- 
In other words, given an observation X, for any i,j with 
Xi = Xj = 1, we know that ^ So- 
Based on this observation, the algorithm starts at t = 1 with 
a fully connected graph connecting the n APs with f^') = 
n(n — l)/2 edges. For each transmission pattern X observed, 
we remove all edges s.t. Xi = Xj = 1. Our first result 
quantifies the number of observations k required to eliminate, 
with high probability, all edges not in Ed, thereby recovering 
the underlying interference graph G/j. In AppendixlAlwe show 
the following result: 

Theorem 1 Let 5 > 0, and let 

fc - -log(l-^) ( l0S (2) +1 °4)- 
Then, with probability at least 1 — 5, the estimated interference 
graph Gd = (V,£d) is equal to Gd after k observations. 

The idea of the proof is first to lower bound the probability 
that two nonadjacent APs i,j never transmit simultaneously 
in k observations. Then, by taking a union bound, an upper 
bound on the required k is obtained. 

Remark: In the theorem the upper bound k = 0(d 2 logn) 
when p 2 /d 2 <C 1. This is a natural regime where wireless 
networks become dense and the carrier sensing threshold is 
fixed. This order is the best we hope for in terms of p and 
d through passive observations. This is because if two non- 
interfering APs never transmit simultaneously, their behavior 
is the same as if they were within each other's carrier sensing 
range. Thus, we cannot determine whether or not there is an 
edge between them. Since each transmitter competes with its 
neighbors, the probability that it gets the channel has the order 
of 1/d. So, roughly speaking, it takes about d 2 snapshots to 
observe two non-interfering APs active at the same time. Since 
there are about n 2 such pairs in the network, a union bound 
gives us the factor log n. 

B. A Minimax Lower Bound for Inference of Gd = O^t^d) 

We now provide a minimax lower bound on the number of 
observations needed to recover the direct interference graph 
Gd = (V,£d)- Denoting the estimated graph as Gd = 
(V,£d) we prove the following result in Appendix iBl 

Theorem 2Ifn>3, d-l< (3n - \/n 2 + 16n)/4, and 

a{d-l) 2 , 
k < rlogn, 

( 2 + ^) 
then, for any a, < a < 1/8, 

minmaxP(G I5 ^ G D \ G D ) > — (l-2a- \f^-] ■ 

The approach to this result is the following. We construct 
a set of M maximum-degree d graphs. We construct the set 
so that the graphs in the set are very similar to each other. 
This will make it hard to distinguish between them. For each 
graph in the set the statistical assumptions of Section IH-DI 



induce a distribution on the observed transmission patterns. 
Given k observed transmission patterns we consider the M- 
ary hypothesis test for the underlying graph. Since the size 
of the candidate graph set for the original estimation problem 
is much greater than M, this test is easier than the original 
problem. Therefore, a lower bound for this test will also 
lower bound the original estimation problem. Since for each 
graph we know the distribution of patterns, we can lower 
bound the probability of error for this hypothesis test using 
the Kullback-Leibler divergence between each pair of induced 
distributions. We prove that, for the collection of graphs we 
construct, the number of observations required to detect the 
correct underlying graph has the order of fl(d 2 logn). 

Remark: We note that the order of the lower bound, 
fl(d 2 logn), has the same order as the upper bound in The- 
orem Q] when p 2 /d 2 <C 1. Therefore, the estimation method 
based on pairwise comparison is asymptotically optimal when 
d —> +00. 

C. An Upper Bound for Inference of Gh = (V, £/r) 

We now present result on inferring the hidden interference 
graph Gh = (V,£h) through observations of X(l), . . . X(fc) 
and Y(l), . . . Y(fc). When the transmission of AP j fails 
(indicated by the feedback Yj = 0) the failure must have 
been caused by collision with a transmission from one of 
the active APs in Sj. However, as there may be multiple 
hidden interferers, there may be no single AP that is always 
transmitting when Yj = 0. Complicating the situation is the 
fact that an AP that transmits regularly when Yj = 1 may not 
be a hidden interferer at all. This is because the transmission 
status of an AP can be highly positively correlated (because 
of CSMA/CA) with one or more hidden interferers. We first 
illustrate the types of statistical dependencies we need to 
address before presenting our algorithm. 

First consider a scenario where a triplet of APs (i,j,k) 
lie in the direct interference range of AP /, i.e., i, j, and 
k are all are in Mi. Further assume that there is no direct 
interference between i, j, and k. The transmission of any of 
these three APs will suppress the transmission of AP I and 
thus increase the (conditional) probability of transmission of 
the other APs in A/j. Thus, the activation statuses of i,j,k 
are positively correlated. Now say that i,j are both hidden 
interferers for AP m. Then, even though transmissions of AP 
k may be correlated with transmission failures of AP m, due 
to the positive correlation statuses of i,j,k, AP k may not 
be a hidden interferer for node m. One possible scenario is 
that F(Y m = 0\Xk = l,X m = 1) may be even greater than 
P(y m = 0\X t = l,X m = 1) or ¥(Y m = 0|Xj = l,X m = 1). 
The upshot is that correlation-based approaches to determining 
hidden interferers, such those adopted in Q, ||8), ma Y not 
accurately distinguish true interferers from non-interferers. To 
address these issues we propose the following approach, based 
on minimum hitting sets. 

First, given k observations, define 

K j {k) = {t& {i,2,..., k} 1 *;•(*) = 0} 

to be the sessions in which AP j's transmissions fail. For each 
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t G tCj (t) define the set of candidate hidden interferers as 

S t j ={ieV\i^j,X i (t) = l} 

and let 

Sj(k) = argmin{|5| | S n 5j ^ 0,V t G /C,(fc)}, 

where, if there are multiple minimizers, one is selected at ran- 
dom. The set Sj(k) is a minimum hitting set of the candidate 
interferer sets {<Sj}tex:,,(fc) P er the following definition: 

Definition 1 (Minimum Hitting Set) Given a collection of 
subsets of some alphabet, a set which intersects all subsets in 
the collection in at least one element is called a "hitting set". 
A "minimum " hitting set is a hitting set of the smallest size. 

In Appendix ICl we prove the following lemma. 

Lemma 1 lim fc _ i . 00 P(S 3 (k) = Sj) = 1. 

Lemma [T] implies that when k is sufficiently large, identify- 
ing the minimum hitting set of the candidate interferer sets is 
equivalent to identifying the hidden interferer set of an AP. In 
the following, we drop the argument k in Sj (k) for conciseness 
of notation and without fear of ambiguity. 

Given k observations, our algorithm determines the mini- 
mum hitting set Sj of {<Sj | t G Kj(k)} for each j G V. By 
Lemma[T]we recover Gh = (V, £h) correctly, with high prob- 
ability, for k sufficiently large. The following theorem provides 
an upper bound on the number of observations required so that 
Sj = Sj for every AP j € V with high probability. Once the 
estimated minimum hitting set Sj is obtained, the estimated 
hidden interference graph Gh is constructed by adding a 
directed edge from each AP in Sj to AP j, i.e., 

£h= (J \ieSj}. 

jeV 

The following theorem is proved in Appendix [D] 

Theorem 3 Let 5 > 0, and let 

k > t \- — r- ( log(ns) + log i J 

-log(l- p ' (1 -g p -" ) V SJ 

Then, with probability at least 1 — 5, Gh equals Gh- 

The approach taken in the proof can be summarized as 
follows. For every AP j, we first upper bound the probability 
that the minimum hitting set obtained after k observations is 
not equal to the true minimum hitting set. This is equal to the 
probability that there exists at least one AP i G Sj that is not 
included in Sj. By taking the union bound across all possible 
i and j, we obtain an upper bound on k. 

1) Finding the minimum hitting set: In general, finding 
the minimum hitting set is NP-hard lfl2l . However, under the 
assumption that s <C n, the minimum hitting set can be solved 
for in polynomial time. This is a regime appropriate to the 
large-scale wireless networks of interest in this paper. First 
we use the algorithm of Section UlI-AI to identify Go- Next, 
each AP in turn. For AP j test every subset of nonadjacent 



APs in Gfl (and not including AP j) to determine whether 
it is a hitting set of {Sj \ t G /Cj(fc)}. Since the number of 
hidden interferers Sj < s, we start with the smallest possible 
hitting sets, i.e., Sj = 1. We increment the size of the testing 
subset by one, until a hitting set is achieved. In this way we 
find the minimum hitting set for the given {Sj \ t G K,j(k)}. 

The worst situation for this incremental approach will be 
when Sj is as large as possible. By Assumption [T}(v) Sj < s. 
Recalling that dj — 1 is the number of direct interferers (which 
can be eliminated from consideration), in this case the number 
of subsets we test is at most 

E ("";*) -*(»-*)■ 

for some bounded constant In other words, the number of 
subsets we need to test is upper bounded by 0(n s ) |[T3l . 

2) A special scenario: s = 1: In certain situations hidden 
interferers are rare. For example, they are rare when the 
threshold for carrier sensing is set low which makes carrier 
sensing range large. It can be reasonable to assume in such 
settings that there is at most one hidden interferer for any AP, 
i.e., s = 1. In this subsection we show that in this case, the 
minimal hitting set algorithm presented above specializes to a 
correlation-based method. 

Specifically, assume that the single interferer in Gh for AP 
j is AP i. Then, whenever t G ICj(k), Xi must equal 1. Of 
course, transmitter i may not be the only active transmitter 
when Yj = 0. However, conditioned on the event Yj = 0, 
F(Xi = l\Yj = 0) = 1, which is strictly larger than V(X t = 
l\Yj = 0) for any other AP I ^ j. Therefore, thresholding 
P(Xi — l\Yj ■ — 0) reliably detects the true hidden interferers. 
Take k observations of X and Y. For each AP j, we count 
the number of times that an AP is active for all t G ICj(k). 
The AP with the highest count is the hidden interferer of AP 
j. This is equivalent to searching for the minimum hitting set, 
since the hidden interferer appears in every candidate interferer 
set. We can show that it is also equivalent to the correlation 
based approach in (SJ- When s > 1, thresholding the empirical 
frequency of APs being active for t G K,j (k) is different from 
the minimum hitting set method. 

The hidden interference graph learning problem is related 
to the partially labeled multi-class classification problem dis- 
cussed in 04]. In lfl4l a problem is studied wherein each 
object is given a candidate set of labels, only one of which 
is correct. In our problem, for each AP j, from the candidate 
hidden interferer set 5* we want to extract the indexes (labels) 
of the actual interferers. When s = 1, the only true hidden 
interferer is the correct "label" for AP j. In general, our 
problem differs from 0141 in that there may be more than 
one true hidden interferers (correct labels) for an AP in our 
problem. Therefore, our framework generalizes the partially 
labeled multi-class classification problem, and implies broader 
applicability of our approach. 

D. A Minimax Lower Bound for Inference of Gh = (V, £h) 

Finally, we provide a lower bound on the number of obser- 
vations required to recover the underlying hidden interference 
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graph Gh = (V, £#). In Appendix [E] we prove the following 
theorem. 

Theorem 4 If 

d < cin, s — 1 < c 2 n, 2ci + c 2 < 1 (3) 

and 

k< log(2ci(^-l)n) 

(i^£)l) 2 (l-p) S -llog(l-p min ) 
then for any a, < a < 1/8, 
min max P(G ff ^ G H :G) > 

V2 Cl (^-l)» / _ / 2a ~ \ 

The conditions of (0 ensure that the max degree d and 
bound on the number of hidden interferers s both scale at 
most linearly in n, subject to a constraint on the joint scaling. 
The approach to this result is the same as we followed for 
the proof of Theorem [2] We reduce the original problem to 
an M-ary hypothesis test and show that, asymptotically, the 
lower bound has the same order as the upper bound. 

Remarks: Since the distribution of Y depends on the 
underlying direct interference graph as well as on the hid- 
den interference graph, the lower bound is over all possible 
interference graphs G. As d increases, the lower bound in 
Theorem |4] is of order ^— i og (i- P f)(i-p) 3 ~ 1 n ) ' Since 
log(l — p m in) can be approximated as — p m i n when p m i n is 
small, the lower bound is of the same order as the upper bound 
provided in Theorem [3] Therefore, the bounds are tight and 
our algorithm is optimal. 

IV. Main Results for Random Channels 

In the earlier sections we study a setting where the channel 
states between pairs of APs and between APs and clients 
are static. However, the time-varying nature of the wireless 
medium means that these modeling assumptions may be 
optimistic. The channel state between a pair of communication 
devices is often better modeled as random. One implication is 
that the sensed signal strength at an AP - or the interference 
level at a client's receiver - is also random. We now try to 
capture this randomness in our model. We modify our earlier 
model by introducing independent random noise into the car- 
rier sensing channel between APs and into the communication 
channel between APs and clients. The inference algorithms 
must be adjusted to accommodate these additional sources of 
randomness. 

A. System Model with Random Channel States 

Because of the randomness of channel states between a pair 
of APs, two APs may not always sense each other, even though 
they are within each other's carrier sensing range. In order to 
capture this randomness, we define to be the probability 
that AP j cannot sense AP i when AP i is transmitting, e.g., 



the signal strength received from AP i is below the carrier 
sensing threshold at AP j. As before we assume reciprocity of 
channels between APs so = qji for all i, j £ V. We again 
categorize edges into direct and hidden interference sets, but 
the distinction is less hard than before. For every £ £d 
we assume 

Qij — q\ow 3 

for some gi ow > 0. This assumption reduces to that of the 
static scenario of Sec. [Ill] if we set q\ ow = 0. On the other 
hand, for £ £h = £\£d we assume that 

Qij > Qup, 

for some q up < 1; earlier we assumed that q up = 1. Further, 
< qiow < <Zup < 1 and both probabilities may scale with 
d, n. 

The definition of pij remains the same as before. Recall that 
Pij is the probability that AP i causes transmission failure of 
AP j, and > p m ; n for all (i, j) £ £h- In the static scenario, 
the mainly model the random locations of clients that 
communicate with APs. In the random channel state setting, 
the p^ can also capture randomness in channel states. In 
Section IIV-CI we use this flexibility to model transmission 
failures due to deep fades. We define p$j to be the probability 
that the transmission from AP j fails due to poor channel 
conditions. While these failures are not caused by any active 
AP, this is not necessarily known and thus we need to 
incorporate this cause of failure into the interference set iS^ . 
Thus, we consider to be a "virtual" interferer and define the 
augmented set of interferers as Sj U 0. We will assume that 
pq,j is independent of the status of all APs in the network. 

B. Direct Interference Graph Inference 

Since in the random channel state setting directly interfering 
APs may not always detect each other, we keep count of 
the number of co-occurring transmissions and require this 
number to exceed a threshold to declare existence of an edge. 
Assuming a length-fc observation interval define 

f E k t=1 HMt) = x 3 (t) = n 
lJ ~ k 

where 1{ } is the indicator function. If ly > e we declare 
that APs i and j do not interfere. Thus, to get the estimated 
graph Go we start with a fully connected graph and remove 
all edges such that > e. As we see in Theorem [5] 

the value of e depends on q up , q\ ow , as well as p, d. 

To derive our results we require the following assumption 
in addition to those of Assumption Q~| 

Assumption 2 

(i) 9up ^ c^,d 2 q\ ow , where C3 > 1 is a constant. 

Assumption [2] provides a sufficient condition to distinguish 
the edges in £0 from the non-edges. Intuitively, we expect to 
observe that a pair of APs not connected by an edge in £ £> to 
transmit simultaneously more frequently than those connected 
by edges in £r>. In the worst scenario, two APs £ 
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Ed, have d — 1 neighbors each in Gd, while another pair 
(i', j') £ £d only have each other as the neighbor. Assumption 
12 ensures that co-occurring transmissions of i,j occur more 
often than those of (i',f) even in the worst scenario. 

In Appendix [F] we prove the following achievability result: 



Theorem 5 Under Assumptions Q] and |2] if 



P 2 g up (l ~ l/c 3 ) 
2d 2 



and 



fc > _L flogn + ibgi 



f/ien, vw'f/i probability at least l — S, the estimated interference 
graph Gd = (V,£d) is equal to Gd- 

We note that the upper bound on the number of observations 
k for reliable estimation of the direct inference graph is 
0(d 4 logn). This is worse than the minimax lower bound 
in static setting. This is natural since randomness in commu- 
nication channels corrupts the observations, making it more 
difficult to distinguish true interferers from non-interferers. 

C. Hidden Interference Graph Inference 

As discussed in the section introduction, we can now model 
transmission failures due to both deep fades and interference. 
The following assumption says that the former are not too 
likely. If deep fades were more likely to cause transmission 
failures than interference, it would be extremely hard to 
distinguish such failures from other hidden interferers (and 
would really perhaps not be very useful to determine the 
interference graph at all). 

Assumption 3 

(i) We assume that p$j is upper bounded by p$ for any j 
and further that p® < p m i n . 

Besides Sj U 0, direct interferers may also collide with 
node j if they cannot sense each other and yet happen 
to transmit simultaneously. The existence of three different 
interference sources for transmission collision (deep fades, 
hidden interference, direct interference) makes the problem 
much more complicated compared to the static channel setting. 
The minimum hitting set method cannot be applied in this 
scenario. One obvious reason is that channel states cannot be 
measured, so can be used to explain every collision, i.e., is 
the minimum hitting set for every AP j. Therefore, Sj cannot 
be attained by the minimum hitting set method. 

In the static channel setting discussed in Sec. IIII-CI the 
estimated Gd is not required as an input for the hidden 
interference graph estimation. Since the channel state between 
each pair of APs is static, if the APs are neighbors in Gd 
they are guaranteed not to transmit simultaneously. This is 
not the situation in the random channel state setting since a 
pair of neighboring APs in Gd can transmit simultaneously. 
In order to isolate the effect of the direct interferers from 



that of the hidden interferers our algorithm focuses on the 
collision events of an AP when none of its direct interferers are 
active. Determining this subset of events requires knowledge 
of Gd- Thus our algorithm starts by estimating Gd using the 
algorithm of Sec. IIV-BI That estimate is then used as an input 
to estimate Ga- 
in the following we assume that the estimate Gd has already 
been made. The algorithm for inferring the hidden interference 
graph makes use of the following two indicator functions, both 
defined for any S and AP pair i,j where S C V\{i,j}. The 
first is 

I ijtS (t) = l{X s (t) = 0,X#.(t) = 0,Xi(t) = Xj{t) = 1}, 

where Afj is the estimate of Mj deduced from Gd- This 
indicator selects the sessions in which nodes i and j both 
transmit, the (estimated) neighbors of node j do not transmit, 
and nor do any nodes in the set S. The second indicator 

4s« = %sW'lft(t) = 0} 

further narrows the sessions selected by the first indicator to 
ones in which node j transmission fails. 
We define the estimate 



Pij 



mm — 7 . 

SCVV^uRj} Et=i%sO) 
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Searching for the optimal S to attain p^ is to isolate the 
effect of hidden interferers other than i, together with that of 
Mj, from that of i. Thus, if Mj = Afj (which would be the 
case if Gd = Gd), i and are the only possible sources for 
the counted collisions. 

If i £ Sj, then is an estimate of 

V(Yj = 0\Xj = l,X i= l,X SjV = 0,X#. = 0). 

If i ^ Sj, then is an estimate of 

F(Yj = 0\Xj = l,Xi = l,X Sj = 0,X#. = 0). 

The importance of this estimate results from the following 
lemma proved in Appendix iGl 

Lemma 2 For any j £ V there exists a unique set Sj such 
that, for any i £ Sj, 

r(Yj = 0\Xj = 1, X t = l, X Sj \i = 0, X M] = 0) > Pij . (4) 

Further, for any i £ V\{«Sj U Mj}, 

¥{Yj = 0\Xj = l,X l = l,X Sj = 0, X Mj = 0) = P$r (5) 

Thus, if Mj = Mj the lemma tells us that the value of p^j 
indicates whether i £ Sj. 

Say that for some pair of APs, p^ > S where 6 is 

equal to the average value of p m i n and p® . The algorithm is to 
add the edge to the estimated hidden interference graph 
Gr for all such pairs. 

In Appendix IH1 we prove the following: 
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Theorem 6 Under Assumptions [7] [2] and [3] ;/ 

P 2 gup(l - 1/C 3 ) 

" 2d 2 

c Pmin + P0 



(Prnin -£>0)(1 ~P) S (1 ~ g low ) d p 2 

(1 + r5)d 2 



ant/ 

k > max | ^ log 7i+ ^log-), + 2) log n + log -)} , 

(6) 

f/ie« f/ze estimated hidden interference graph Gh is equal to 
Gh with probability 1 — 7 after k observations. 

The first term in (|6]l guarantees that k is sufficient large to 
infer Gd reliably, and the second term ensures that pij con- 
centrates around its mean value. Asymptotically, the second 
term in the bound of k dominates the first term, and the upper 
bound on k for reliable estimation of the hidden inference 
graph is O ( (1 , p) . a( f_ giow)2d log n) . 

V. Simulation Results 

In this section we present simulation results for the fol- 
lowing model. Access points and clients are deployed over 
a rectangular area that can be partitioned into square cells 
50to on a side. In each cell an AP and several clients are 
placed uniformly at random. The locations of APs and clients 
are fixed throughout the period of observation. Each client is 
associated with the nearest AP. By generating the networks in 
this manner the traffic load across APs is relatively balanced. 

For each client in its communication range, the AP generates 
an independent traffic flow for it according to a Poisson pro- 
cess with 0.1 packet per session. In order to predict received 
signal strength between APs and clients, we employ the log- 
distance path loss model. In this model, received power (in 
dBm) at unit distance (in meters) from the transmitter is given 
by: 



r(o = r(i )-ioa]og(i//o) + x 



(7) 



In the above, r(Zn) is the signal strength at reference distance 
Iq from the transmitter, a is the path loss exponent, and X a 
represents a Gaussian random variable with zero mean and 
variance a 1 in db. We choose Iq to be 1km, er 2 to be 3db, and 
a to be 4. We also assume that the "shadowing" (represented 
by X a ) between any pair of APs and AP-client pair is fixed 
throughout the period of observation. Thus the underlying 
interference graph is constant within the observation period. 

The system operates according to IEEE 802.11 CSMA/CA 
protocol. In comparison to the statistical assumptions made 
to derive the theoretical learning bounds, this setup mimics 
wireless networks more closely. 

We first study the direct interference estimation algorithm 
of Section IIII-AI We fix the carrier sensing range for the APs 
to be 70m and simulate five networks, respectively consisting 
of 4 x 7, 4 x 8, 4 x 9 and 4 x 10 cells. For each network size, 
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Fig. 2. The observation duration to recover direct interference graph for 
networks with d = 5 and s = 2, plotted as a function of the number of cells 
in the network. 
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Fig. 3. The observation duration to recover direct interference graph for 
networks with 4x8 cells, plotted as a function of maximum degree d. 



we randomly generate 10 topologies, i.e., randomly placed 
APs and clients. For each topology, we use the algorithm to 
recover Go five times with different (randomly generated) 
traffic traces. We report the average observation duration (i.e., 
number of sessions) required to recover the direct interference 
graph for each topology. The median observation time over 
the 10 topologies is plotted in Fig. [2] The worst case is also 
plotted. 

Next, we fix the network size to be 4 x 8 cells and randomly 
generate topologies. For each topology, we vary the carrier 
sensing range to be - 70m, 80m, 90m, 100m - and randomly 
generate topologies. We check the maximum degree d (the 
number of direct interference edges per node) and s (the 
number of hidden interferers per node) for each topology. We 
select ten topologies with s = 2 for each d varying from 5 to 
8 and each carrier sensing range. We then simulate each case. 
We plot the median observation time required to recover Gd 
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Fig. 4. The observation duration to recover hidden interference graph for 
networks with d = 5, s = 2, plotted as a function of the number of cells in 
the network. 

as a function of the maximum degree of the network in Fig. [3] 
The worst case is also plotted in the same figure. 

The plots show the scaling that was predicted by the theory. 
The necessary observation duration scales sub-linearly (loga- 
rithmically) with network size n, and super-linearly (quadrat- 
ically) in d. 

Next, we study estimation of the hidden interference graph. 
In Fig.|4]we plot the observation duration required to correctly 
identify the minimum hitting set for each node and recover 
the hidden interference graph as a function of network size. 
The same simulation conditions hold as were described in the 
discussion of Fig. [2] For this algorithm we again observe 
that the necessary observation duration scales sub-linearly 
(logarithmically) with network size n. 

In Fig.|5]we examine the dependence of the necessary obser- 
vation duration on s for hidden interference graph inference. In 
these simulations we fix the network size to be 4 x 8 cells and 
the carrier sensing range to be 80m. We randomly generate 
topologies with fixed d = 5, and s varying from 1 to 4. 
The necessary observation duration as a function of s is then 
plotted. We observe that the observation duration increases as 
s increases. However, it scales in a complicated form. This is 
because the hidden interferers cause transmission collisions in 
the networks, which actually increases the traffic intensity at 
each AP. 

We also examine the robust direct interference estimation 
algorithm of Section lTV-BI Unlike the static scenario where we 
assume that the shadowing (i.e., X a ) between a pair of APs 
is a constant during the observation period, under the random 
channel state setting, we let X a in (O be an i.i.d Gaussian 
random variable with zero mean and 3dB variance across the 
observations. We repeat the simulation procedure under this 
random channel setting, and evaluate the performance of the 
robust direct interference estimation algorithm with e = 0.007. 

We plot the median average observation time over 10 
topologies required to correctly recover Go in Fig. [6] The 
worst case is also plotted. Next, we plot the median and worst 



Fig. 5. The observation duration required to hidden interference graph for 
topologies with 4x8 cells, plotted as a function of maximum degree s. 
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Fig. 6. The observation duration to recover direct interference graph for 
networks with d = 5 and s = 2 under random channel state setting, plotted 
as a function of the number of cells in the network. 

observation times required to recover Gd as a function of the 
maximum degree of the network in Fig. Q The plots show 
the scaling that was predicted by the theory under the random 
channel states setting. Again, we observe that he necessary 
observation duration scales sub-linearly with network size n, 
and super-linearly in d. Compared with the simulation results 
under the static setting, we note that it takes much longer 
observation time to successfully recover Go in the random 
channel scenario. This is consistent with the theoretical results 
on the upper bounds. 

VI. Conclusions 

In this paper, we proposed passive interference learning al- 
gorithms and analyzed their learning bounds for both static and 
random settings. For the static setting, we first upper bounded 
the number of measurements required to estimate the direct 
interference graph. Then, we provided a minimax lower bound 
by constructing a sequence of networks and transforming it 
into an M -ary hypothesis test. The lower bound matches the 
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Fig. 7. The observation duration to recover direct interference graph for 
networks with 4x8 cells under random channel state setting, plotted as a 
function of maximum degree d. 



upper bound (up to a constant), thus the bound is tight and 
the algorithm is asymptotically optimal. We then analyzed the 
hidden interference graph estimation, and provided matched 
lower and upper bounds following similar approaches. For the 
random setting, we proposed robust graph inference algorithms 
to estimate the direct and hidden interference, and presented 
upper bounds on the number of measurements required for 
reliable estimations. We also presented an experimental study 
that lends support to the theoretical analysis. 



Appendix 



A. Proof of Theorem [7] 



Consider any two nonadjacent nodes i,j in Go = (V,£d). 
Let Nij = Mi U Mj, and Mi\j = /Vj n Nf. nodes that are the 
neighbors of node i but are not neighbors of node j. 

Under the CSMA/CA protocol, transmitter i only contends 
for the channel when Qi = 1. For ease of analysis, in this 
proof, we assume that every transmitter competes for the 
channel at the beginning of each session no matter whether 
or not Qi = 1. Then, if transmitter i gets the channel, it 
starts transmit if Qi = 1. Otherwise, even thought node i has 
the opportunity to transmit, it keeps silent. The latter event 
leaves the opportunity open for one of its neighbors with a 
longer back-off time to transmit. Since this does not change 
the outcome of transmission pattern given that the back-off 
times of those transmitters with nonempty queues are fixed, 
the statistics of X := (Xx, X2, . . ■ , X n ) stay the same under 
this interpretation. 

Define Ttyi as the minimum back-off time of the nodes in 
the set Mi. Then, 



(Ti < , Tj < IV, 



\Mi\l- \Mj\ 



,1 

% I " 



(8) 



> 



1 



{di + l){dj + 1) 



> 



1 



The probability in ([8]) is obtained in this way: Considering 
nodes i, j and their neighbors, there are \Mij \+2 nodes in total, 
and there are (\Mij \ + 2)1 orderings of their back-off times. 

Among these orderings, (ju^j+i) ' l-^l' ' l-^j'vl' orderings 
correspond to Ti < T_\f { ,Tj < Tjy..^.. Such ordering can be 
obtained in this way: suppose these IJVyl+2 nodes are ordered 



according to their back-off times. Node i and its neighbors 
take \Mi\ + 1 positions in the ordering. There are ('n^j'+i) 
different combinations of positions. Since Ti < TV^ node i 
takes the first position out of the chosen \Mi\ + 1 positions, 
the remaining \Mi\ positions are for its neighbors. This results 
in \Mi\l orderings for each combinations of positions. Node 
j and the nodes in Mj\i take the rest of the positions, where 
node j takes the first. This gives us the factor |7V}\j|!. 

When Ti < T/v 4 , and Qi = 1, based on the protocol, 
node i gets the channel and thus Xj = 1. At the same time, 
transmission from all nodes in Mi are suppressed. Therefore, 
for node j, if Tj < Tj\f... and Qj = 1, node j also gets a 
channel. Thus, we have X. t = Xj = 1. Since other scenarios 
result in Xi = Xj = 1, we have the following 

F(X i = l,X J = l) 

> P (Ti < T Ni , Tj < T M . Ki , Qi > 0, Qj > 0) 

= P [Ti < T Mi , Tj < 7V. V ) F(Q l > 0, Qj > 0) > ^ 
P(edge (i, j) is not removed with one observation X) 

= l-P(X i = l,X J = l)<l-^. 

Define Aij as the event that edge (i, j) is not removed after k 
observations. Then, the probability that, after k observations, 
the graph cannot be identified successfully is 

k 



l- p - 

d 2 



The inequality follows from the fact that the number of 
nonadjacent pairs in Go = (V,£d) is upper bounded by (™). 
Under the assumption that d n, this is a good approximation 
for the nonadjacent pairs in Go- 

B. Proof of Theorem [2] 

First, we define Qd as the set of graphs consisting of n 
nodes that have maximum degree d — 1. We are going to 
construct a collection of M +1 graphs {Gpo, Gjji,..., G^m} 
where Gbi G Qd f° r a U We denote the distribution of 
transmission patterns x £ {0, 1}™ for each of these graphs as 
P (x), -Pi(x), . . . ,P M (x), respectively. Define F(A;G D ) to 
be the probability of event A occurring where the underlying 
direct interference graph is Gd € Qd- A metric of interest is 
the edit or "Levenshtein" distance between a pair of graphs. 
This is the number of operations needed to transform one 
graph into the other. We denote the Levenshtein distance 
between Gm and Goj as D[ J (GDi 1 GDi)- Then, we use the 
following theorem to obtain the lower bound. The theorem is 
adapted from |[T31l . 



Theorem 7 Let k e ^ 

such that 



M>2, {G m ,...,G DM }eg d be 
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D L (G Di ,G Dj ) > 2r, forO<i<j< M, where D L is 
a semi-distance, 

Ta Efli Dkl&W < alogM, with < a < 1/8. 



Then 



inf sup P(D L (G D ,G D )>r- 7 G D ) 

> inf m&xP{D L {G D ,G Dt )>r;G Dt ) 

G D £Q d 1 



>M 2a 
> — — = I I -2a- J- — > 



i + Vm 



log M 



In the following, we construct the M + 1 graphs 
{Gdo, Gdi, ■ ■ ■ , Gdm} and leverage Theorem [7] to obtain a 
lower bound on k. We first construct Gdo and characterize 
Po( x )- Then, we construct the rest of the M graphs by 
perturbing Gdo- These graphs will be symmetric in the sense 
that D KL (Pi\\P ) will be the same for 1 < i < M. We 
calculate Dkl(Pi\\Pq) and then lower bound the number of 
required observations k. 

1) Gdo and its transmission pattern distribution Pq(x): 
We let graph Gdo consist of \nj (d— 1)] disconnected cliques. 
The first \n/(d — 1)J := mo cliques are fully connected 
subgraphs with d — 1 nodes, as shown in Fig. [8] The last 
clique is also full connected, and the number of nodes is less 
than d—1. In the following analysis, we focus on the first mo 
cliques. 

Denote C m as the set of nodes in the mth clique and let Xc m 
be the restriction of the transmission pattern X to the nodes in 
the clique (Xc m is a set of sub vectors, m = 1, 2, . , . , mo that 
partition X). Define as the unit vector of dimension d—1 
whose ith element is one. Define as the d—1 dimensional 
all-zeros vector. Due to the fully connected structure of the 
cliques, no more than one node in any C m can transmit at any 
time. We denote the set of all feasible transmission patterns for 
clique m as X m := {0,e 1; . . . ,eic m i}. If Xc m = 0, then no 
node in C m is transmitting in that session. For each individual 
clique, this event happens only when none of the nodes in that 
clique has traffic to send, i.e., 

0;G D o) = (l-p) d - 1 - 



B (X C 



(9) 



Otherwise, when at least one AP has a packet to send, 
the channel will not idle. Because the QiS and T^s are i.i.d. 
across nodes, and the clique is fully connected, each node in 
that clique has the same probability of occupying the channel. 
Thus, for any j, j e {1, 2 . . . , \C m \}, 

{1-vY 1 - 1 



\X Cm =e j ;G D0 ) = 



1 



q, 



(10) 



Since the behavior of the cliques are independent, we have 



mo+l 

n p ( x ^ 

m— 1 



x e m ; Gdo)- 



(11) 



2) Construct M — n graphs: In this subsection, we 
construct a sequence of graphs Gdi, Gd2, ■ ■ ■> Gdm- We 
will wan to construct n graphs, i.e., M = n. We construct 
each graph by picking a pair of nodes from distinct cliques in 
first mo cliques in graph Gdo- We add an edge between the 



selected pair. We leave the last ((mo + l)th) clique unmodified 
for G D i, ■ • ■ , Gdm- We can construct (™°) (d - l) 2 = (d - 
l) 2 mo(mo — l)/2 different graphs in this manner. Under the 
assumption that 

d-l<n/2, (12) 

and the fact that mo = Ls~rJ — ~3~~[ — 1, we have 

{d-lf . „ {d-lf 
K -— ^m„(m - 1) > y —— -L- 



- 1 



1 



(d-lf-^{d-l) + ^- 



1 



- 2 



(13) 

This is a quadratic function of d — 1 for any fixed n. We want 
to have M = n and the value of $13[ equals n if 



d-l = 



3n ± v7? 



16n 



Since d — 1 < n, and \ln 2 + 16n > n, only the sm aller 
solution is feasible. When n > 5, we have 3 "~ v/ " a+1 — > 1, 

— 4 

thus the assumption d — 1 > 1 is satisfied. Meanwhile, since 

2>n — n n 

Z 2' 



3n — y/n 2 + 16n 



< 



(14) 



4 4 
it is a tighter constraint on d — 1 than (fT2)l . Therefore, under 
the condition that 



n > 5, d- 1< 



3?i — \J n 2 + 16n 



we have (d — l) 2 mo(mo — l)/2 > n and thus we can always 
pick n graphs that are perturbations of Gdo in the above sense. 
We note that for each of these graphs DL(GD0,GDi) = 1, and 
D L (G Di ,G D j) = 2 for any < i, j < M where i ^ j. 

3) Gdi and its transmission pattern distribution Pi(x): 
We now calculate -Pi(x), which differs from -Po( x ) due to the 
added constraint resulting from the additional edge. Due to 
the symmetric construction of the M graphs, without loss of 
generality we concentrate on a single graph. Let Gdi be the 
graph formed from Gdo by connecting the ith node in C\ to 
the jth node in C2 with an edge. Since the remaining mo — 1 
cliques are unchanged, the distribution of their transmission 
patterns Xc m is the same as under Gdo- Furthermore, they 
are independent of each other and of (Xd,Xc 2 ). Thus we 
express the transmission pattern distribution under Gdi as 

Pi(x) = P(X Cl = x Cl ,X C2 = x C2 ;G m ) 

mo + l 

• J] P(X Cm =xc m ;G D0 ). (15) 

m— 3 

We want to calculate the KL-divergence between Po( x ) 
from (fTTT i and -Pi(x). The final mo — 2 terms of both are 
identical. Thus, for the remainder of this subsection we focus 
on the distribution of the activation pattern in the first two 
cliques, i.e., P(X Cl = x Cl ,X C2 = x C2 ;G_di). 

The differences between P(Xd = xc 17 Xc 2 = xc 2 ;Gdi) 
and P(Xcj = xc lf Xc 2 = xc 2 ;Guo) are due to the added 
edge, which constraints the allowable patterns. In particular, 
the event = e;,Xc 2 = e 3 , which occurs with nonzero 
probability under Gdo, cannot occur under Gdi- If, under 
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Fig. 8. A network consists of n/(d— 1) cliques, where nodes in each clique 
are within each other's carrier sensing range. Nodes belonging to different 
cliques cannot hear each other, n = 20, d = 6. 

Gdo both nodes i and j would have transmitted, under 
Gdi only one will transmit. The carrier sensing mechanism 
will suppress the transmission of the other. Which node will 
transmit and which will be suppressed will depend on the 
respective back-off times. The result will be an increase in 
the probability of some other node transmitting, since the 
suppressed node will not compete for the channel. 

To get the analysis rolling, we first consider the simpler 
situations of joint transmission patterns (xc x , xe 2 ) where there 
is no possibility that transmission by node i could have 
suppressed transmission by node j (or vice-versa). For these 
situations the probability of the joint pattern under Gdi is the 
same as under Gdo- There are three such cases. The first case 
consists patterns such that (i) some node in Ci other than i 
transmits and (ii) no node in C2 transmits, i.e., G <%'i\e i 
and X£ 2 = 0. Condition (i) means that node i could not have 
suppressed the transmission of node j. Thus, but condition (ii) 
no node in C2 has any data to transmit. Therefore, 

P(X Cl =x Cl ,X C2 =0;G m )=P(X Cl =x Cl) X C2 =0;G™)- 

The second case is the reverse of the first, i.e., (i) xc 2 G X2\&j 
and (ii) x^ = 0. By the same logic, P(Xd = 0, X^ 2 = 
*c 2 ': G di) = P(X Cl = 0, X C2 = x C2 ;G D0 )- The third case 
consists of situations where (i) transmissions occur in both 
cliques, but (ii) neither i nor j transmit. Condition (ii) implies 
that suppression of node j by node i (or i by j) could not 
have occurred, and thus the probability of the joint pattern 
under Gdi is the same as under Gdo- These conditions are 
summarized as Xd G <Yi\{0,e.J and xc 2 G A?2\{0, 
Thus, 

P(X Cl = x Cl , X C2 = x C2 ; Gdi ) = P(X Cl = x Cl , X C2 = x C2 ; G D o ) 

Finally, by dTOb and ( fTTT i we know that P(X Cl = x Cl , X Ca = 
xc 2 ; Gdo) = q 2 - 

We now turn to transmission patterns where nodes i and 
j may have interacted. These are patterns (xc l; xc 2 ) the 



probability of which may be higher under Gdi than under 
Gdo due the the infeasibility of the pattern (e,, e^) under Gdi- 
To analyze these cases consider the paired queue/backoff- 
time vectors (Q,T) that, under Gdo, would have resulted in 
(Xd , Xc 2 ) = (ej, ej). Depending on the particular realization 
of (Q, T) there are four possible outcomes under Gdi- 



(a) Ti < Tj and Qc 2 = e 3 : Node i will get the channel and 
transmit so Xd = However, the transmission of node 
j will be suppressed. Since Qc 2 = ej, node j is the only 
node in C2 with data, so Xc 2 = 0. 

(b) Ti > Tj and Qc 2 = e 3 : The analysis here is analogous 
to (a) with the roles of i and j reversed. Thus, the 
transmission pattern will be X^ 1 = 0, Xc 2 = e^ . 

(c) Ti < Tj and Qc 2 7^ e^: In this situation at least one 
other node in C2 has data to transmit. Thus, even though 
Xci = Bj suppresses the transmission of node j, some 
other node in C2 will transmit. Thus, Xd G Xi\{ei, 0}. 
Furthermore, since the statistics of the queue status and 
backoff times are identical for all nodes, the transmitting 
node in C2 will be uniformly distributed across the other 
d — 2 nodes in that clique. 

(d) Ti > Tj and Qc 1 7^ e^: The analysis here is analogous 
to (c) with the roles of i and j reversed. Xc 2 = e^, and 
Xd is uniformly distributed across G Xi\{e.i,0}. 



We note that the above four cases partition the event space 
where nodes i and j transmit concurrently under Gdo- The 
four terms in the following correspond, respectively, to (a)- 
(d), above: 

P(X Cl =e i ,X C2 =B j ;G D0 ) 

= P(X Cl = e,, X C2 = ej,Ti < T,,Qc 2 = &j] Gdo) 
+ P(X Cl =e,,X C2 =e j ,T i >T j ,Q Cl =e t ;G D o) 
+ P(X Cl = e i; X C2 = ej ,Ti < T 3 , Q C2 £ e r ,G D0 ) 
+ P(X Cl = e h X C2 = ej,Ti > Tj, Q Cl ^ e,; Gdo) 

= 20i + 2&. (16) 

The first two terms are equal due to the symmetry of the 
conditions and graph structure. Similar logic implies that the 
third and fourth terms are also equal. We respectively define 
/?i and /?2 to be the two probabilities. 

We now consider the four cases of joint transmission 
patterns (xc 15 xc 2 ) under Gdi not yet considered. These 
will each connect to one of the cases (a)-(d), above. First 
considered the probability of pattern (xc i; xc 2 ) = (ej,0) 
under Gdi- The probability of this pattern under Gdi will 
be larger than under under Gdo since certain pairs (Q,T) 
that result in (e^e^) under Gdo result in to (e,,0) under 
Gdi- The probability of observing (ej,0) under Gdi equals 
the probability of observing that pattern under Gdo plus the 
probability of the event occurring that was considered in case 
(a), above. This latter event is the bump in probability due to 
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the interaction of i and j. Therefore, we find that 

P(X Cl =e i ,Xc 2 =0;G D1 ) 
= P(X Cl = ei! X C2 =0;Gz3o) 
+ P(X Cl = ei ,X C2 = ej , T l < T 3 ,Q C2 = ej ; G D0 ) 

d-X 



(17) 



where the first term follows from the independence of the 
node-wise transmission patterns under Gdo, (TTOb , and the 
fact that no nodes in fa have data to transmit. We defer the 
calculation of fa = P(X Cl = e.,,X C2 = ej,T < Tj,Qe 2 = 
e j',GDo) till ??. Following a similar line of argument for 
(xc 15 xc 2 ) = (0, ej), and considering case (b), we find that 

P(X Cl =0,X C2 =e J ;G D1 )=q(l-p) d - 1 +(3 1 (18) 

The third case concerns the patterns (ej,xe 2 ) f° r a ll x c 2 G 
X-2\{0, ej}. Similar to (a) and (b), the probability of the 
pattern will equal the probability of the pattern under Gdo 
plus a boost due to the infeasibility of the (ei,ej) pattern 
under Gdi- The boost corresponds to the event discussed in 
(c), above. For any xc 2 G X2\{ej, 0} we find that 

P(X Cl =e. i ,X C2 =x C2 ;G m ) 

= P(X Cl = ej, X C2 = x C2 ; Guo) 
1 



2 



1 



5 



d- 2 



»(X Cl = ei ,x c 
ft. 



ej , T, < Tj , Q C2 7^ ej ; Gdo) 
(19) 



The factor of l/(d — 2) in the second term results from the 
uniformity over the other d — 2 transmission patterns in C2, 
mentioned in (c). The probability fa — P(Xd = ej,Xc 2 = 
ej,Ti < Tj, Qc 2 7^ ej; Gdo will be calculated in d22l i. Finally, 
by symmetric logic, we find that for any xq 1 G A?i\{e,, 0} 

1 



"(Xfii — x Cl , X C2 — ej ; Gdi j 



9 



d- 2 



ft 



(20) 



We now calculate fa, required in (T7\ and dT8T >. We start 
by rewriting the first term ( fTo*b using Bayes' rule as 



e l , Xc 2 = ej , T> < Tj , Qc 2 



Gdo) 



I] Gnaj = p(l — p)'- The second factor is just the fraction of 
the dl orderings such that there are / nodes in C\ with Tk < T 
and such that node T < Tj. The number of such orderings is 

' d 7 2 )Z!( d -;- r 
pieces together we find that 

P(X Cl = e„ X C2 = e 3 , Ti < Tj, Q c 

p*(l-p){<i-z) 



;H 1 )(d-l- 2)! = (d-2)l(d-l-l). Putting the 



e j] Gdo) 



d-2 

E 

.1=0 



(d-i- i)(i- P y 



d(d-l) 



ft- (21) 



And, since transmitters i and j have the same statistics, when 
T > Tj, we also have 

P(X Cl = 0, X C2 = ej ,Ti > Tj, Q Cl = e i5 Gdo) = ft, 

which justifies (1181) . 

Finally, to calculate /?2 we simply combine ( fT~6b with (f2Tb . 

P(X Cl = ei ,Xc 2 =e J -;Gr, ) = 2ft + 2(d - 2)/3 2 = g 2 , 

where the final inequality follows from the independence of 
Xcj and Xc 2 under Gdo- Thus, 



ft = 



2ft 



(22) 



4) Bounding Dkl{Po\\Pi) : The Kullback-Leibler diver- 
gence between the transmission pattern distributions under 
Gdi and Gdo, denoted as P±,Po, respectively, can be cal- 
culated as 

Dkl(Pi\\Po) 

xe{o,i}" v ' 

E p ( x d 

xc 1 e{oa} d - 1 ,xc 2 e{o,i} d ^ 1 

P(X Cl = x Cl , X c , 



log- 



Xd,X C2 — x C2 ; Gdi) 
xc 2 ; Gdi) 



»(X Cl = x Cl ; Gdo)P(X C2 = x C2 ; Gdo) 
2P(X Cl =e,,X C2 =0;Gdi) 

P(X Cl =e t ,X C2 =0;Gdi) 



(23) 



log- 



»(X Cl =e j; GDo)P(X C2 =0:G 



D0j 



Bi.Ti < T j ;G D0 )¥{X C2 = e j: Qc 2 = ej;G D0 ). 



2(d-2)P(X Cl = ei ,X c 



G 



In the application of Bayes' rule we have used the fact 
that P(X Cl = e h Ti < Tj\Xe 2 = ej,Qc 2 - e,G D o) = 
P(Xc! = ej,Tj < T,;Gdo), due to the independence of 
transmission patterns under Gdo- Now, note that P(Xc 2 = 
e i,Qc 2 = ej;GDo) = P0- ~ p) d ~ 2 as node j is the 
only node in fa with something to transmit. Next, rewrite 
P(X Cl = ej, < Tj] Gdo) as 

d-2 

P(X Cl = ei, T < Tj, \{k G ft : T fc < T}| = Z; Gdo) 

z=o 



log- 



P(X Cl 



; Gdi) 



= 2[g(l-p)' 1 - 1 + /? 1 ]log 



X Cl =e. i; GDo)F(X C2 = 

g (l-p)*-i + /3 



ej'; Gdo) 



< 2 



2(d-2 
1 



ft 



9 (l-p) d -i 
2 1 gg 

d-2 



d-2 



lo 



1 



(24) 



(25) 



(26) 



d-2 

E' 

1=0 



'(x Cl 



T < Tj, G ft :T fc <T}| =Z;Gdo 



"(Tj < Tj, |{fc G ft : T fc < Tj}| = /; G 



D0J 



The first factor 



T < Tj, |{fc G ft :T fc <Ti}| 



[d-1) 2 

In d23l we cancel (and marginalize over) the mo — 1 common 
factors of the form P(Xc m = Xc m ; Gdo), 3 < m < mo + 1 
cf. ( fTTT i and ( TTBI l. In ( l24l i. we focus in on the terms that 
don't cancel out. The first two terms therein correspond to 
cases (a) and (b), cf. (O and (O. The latter 2(d - 2) 
terms correspond the cases (c) and (d), cf. ( fT9l and (|20T > 
where j' is some node j' G ft but j' 7^ 7. In $25[ we 
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use ( fTTT i and fl9[ in the numerators and (0 and ( fTTT i in the 
denominators. Finally, substituting in for the definitions of /?i 
and fa from ( f2TT > and d22l i. equation ( f26l > follows from the fact 
that log(l + x) < x. 

5) Put pieces together: Since the Gi^s are constructed in 
the same manner, Dkl (-Pj||-Po) = ^Ki(-Pill-Po) for all i. 
Thus, we have 

1 M / 1 \ 1 

-g^ i( «llft)<(2 + — J^^. 

In summary, we have D^(G Di,G Dj) > 1 for < i < 
j < M and we can always pick M = n. Thus, according to 
Thm. [7] when 

< alogn 

( 2 + T^) JdTW 

we have 

inf sup F(G D ^ G D ;G D ) 

= inf sup P(^(G D ,G fl )>l/2;G D ) 

l + V^V V log"/ 

C. Proof of Lemma [7J 

Since = must be caused by some active interferer, 
Sj H Sj for every £ € fcj(k)- Therefore, Sj is a hitting 
set for {S)} teK][k) . 

Next, we prove that Sj is the unique minimum hitting set. 
We prove this through contradiction. Assume that there exist a 
different hitting set Sj(k) with |Sj(fc)| < |«Sj-|. Since Sj(k) is 
different from Sj, there must exist a node i <G Sj, that is not in 
Sj(k), i.e., i E Sj\Sj(k). Consider the following probability 

F(X i = l,X j = l,Y j = 0,X Mk) =O) 
= F(X i = l,X j = l,X i . {k) =0) 

■F(Y j =0\X i = l,X j = l,X s . {k) =O) 

>F(X i = l,X j = l i Q SjW =°) 

■F(Y j =0\X i = l,X j = l,X Mk) =O) 

> ^(l-pf i{h)l Pij > J(l -P) s p mi n, (27) 

where ( f27T > follows from the assumptions that i G 5j, 
< |«%| < s, and ©. 
For any observation with Xi(t) = l,Xj(t) = l,Yj(t) = 
0, = 0, its time index t belongs to K.j(k), however, 

5j n Sj(k) = 0, which contradicts with the assumption that 
Sj(k) is a hitting set. Since this event has a lower bounded 
probability for any fixed d, s, as k — > oo, this event happens 
with probability one. Therefore, Sj is the unique minimum 
hitting set as k — s- oo. This proves the lemma. 

D. Proof of Theorem \3\ 

Define the error event Cj as the event that the estimated 
minimum hitting set Sj is not equal to Sj after k observations. 



This only happens when \Sj \ < \Sj\, because otherwise Sj is 
a feasible hitting set with smaller size. So when \Sj\ < \Sj\, 
there must exist at least transmitter i 6 Sj that is not included 
in Sj. Then, following the steps in the proof of Lemma [T] we 
have 

F(C j )=F>(U ieSj i<£S j ) 

< (1 - HXi = 1, Xj = h Yj = 0, X^. - 0)) k 

Therefore, 

F(G H ^ G H ) = F(UjCj) 

= ns { 1 & J ' 

E. Proof of Theorem |4] 

To prove Theorem |4] we follow a similar approach to 
that taken in the proof of Theorem [2] For any given direct 
interference graph Gd = (V,£d), we define Q s to be the set 
of hidden interference graphs satisfying the assumption that 
Sj < s for every j. We construct a collections of graphs, 
Gho, Ghi, • ■ • , Ghm, all in Q s , and reduce the interference 
estimation problem to an M-ary hypothesis test. These graphs 
share the same node set and direct interference edges, however, 
the hidden interference edges differ. With a little abuse of 
the notation, we use Pi (x, y) to denote the joint distribution 
of transmission pattern x and feedback information vector y 
under G D and G Hi , < i < M. We use F{A\ G H i) to denote 
the probability of event A under distribution -P;(x, y). Note 
that P(A;Ghi) implicitly depends on the underlying direct 
interference graph Gd- 

1) Construct Gd and Gho- We now construct an under- 
lying direct interference graph Gd, and add directed edges to 
form Gho- An illustrative example of a possible Gd is pro- 
vided in Fig. [9] We partition the node set into \nj (2d+s— 1)] 
groups. The first ln/(2d + s— 1)J groups consist of 2d + s — 1 
nodes. The last group consists of the remaining nodes. In each 
group, except the last, we cluster 2d nodes into a pair of 
cliques, each clique consisting of d nodes. The remaining s — 1 
nodes are "independent" nodes or "atoms", disconnected from 
all other nodes in the network. Thus, their activation status 
depends only on their own queue status; it is independent of 
everything else. 

We construct Gho by adding directed edges to Gd- These 
edges will be added only between nodes in the same group. 
Hidden interference will thus exist only among nodes within 
the same group. It will not exist between groups. To get the 
hidden interference Gho consider each node in each clique in 
each group. Let all s — 1 independent nodes in that same group 
be hidden interferers as well as one (any one) node in the other 
clique in the same group. Thus every node in each clique has 
exactly s hidden interferers. We note that a node in a clique 
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Fig. 9. A direct interference graph Go of n/(2d + s — 1) = 2 groups, 
where each group consists of two fully connected cliques of size d and s — 1 
detached APs; n = 22, d = 5, s = 2. 



is allowed to interfere with more than one node in the other 
clique. The last group can have an arbitrary edge structure as 
long as it satisfies the maximum degree constraints. 

Part of the hidden graph structure are the probabilities 
Pij, defined in ([T), of whether hidden interferer i e Sj 
interferes with the transmission of node j. We now specify 
these probabilities for Ghq- For all i G Sj and S C V\{i,j}, 
the hidden interference satisfies 



W(Yj = Q\Xi = l,Xj = 1,X 



V\{i,j} 



0;G 



HQ) 



V(Y 3 = 0\X, =l,X j = l,X s = x s ; G m ) = P* 



(28) 



where xs is any transmission pattern feasible under the direct 
interference graph Gd- Thus, in contrast to the inequality (f2]i 
in the general setting, for this network the bound holds with 
equality for all i E Sj. The implication is that the transmission 
collision probability for an AP j doesn't increase if there are 
more than one hidden interferer transmitting. This assumption 
holds for every hidden interference graph Gni discussed in 
this section. 

2) Construct M hidden interference graphs: We 
now construct a set of hidden interference graphs 
Ghi,Gh2, ■ ■ ■ ,Ghm as perturbations of Ghq- We get 
each by removing a single directed edge in Ghq- The edge 
removed must have connected a pair of nodes in distinct 
cliques in a group in Ghq- That is, we do not remove an 
edge between an independent node and a node in a clique. 

In each group, there are 2d such edges. There are thus 
2d[n/(2d+s — l)\ distinct edges in Ghq that can be removed. 
If 

d < c\n, s — 1 < C2T1, and 2c\ + c<z < 1, (29) 

where c\ , C2 are positive constants, there are more than 
2ci( 2ei 1 fC2 — l)n := M such edges. For each of these 
graphs, D L (D m ,D Hi ) = 1 and D L (D m ,D H j) = 2 where 
1 < i,j < M. 

3) Characterize Dkl(Pi\\Po)-' Without loss of generality, 
we assume that the directed edge is removed from Ghq 
to form Ghi, where i G C\, and j G C 2 . Since we are removing 
an edge from Ghq to get Ghi, we assume property (f28T > is 
inherited. Thus, the only difference between the distributions 



under Ghi an d Ghq occurs when = ej,Xc 2 = e^, and 
Xsj\t = 0- Specifically, 

F(Yj = 0|X Cl = a, X C2 = ei ,X SAi = 0; G H q) = Pmin, 

while 

P(Y 3 = 0|X Cl = e 4 ,X C2 = e,,X 5Al = 0;G m ) = 0. 
Therefore, 

^L(Piii^o) = E p i( x 'y) 1 °g ; &r 1 4 

p o(x,y) 

E^Wfi(y|x)iog ^| x i^ x i 
P (y|x)P (x) 



\A;G m 



y 3 e{Q,l} 



MvM) 



- P y 



(l-pJ'^logCl-jw) 



where Pi(x) = Po( x ) since Gd is held fixed, .4. := 
{X Cl = ei ,X C2 = e,-,X s . v = 0}. V(A;G H1 ) = P(X Cl = 
e 4 ;G Hl )F(X C2 = e j; G m )P(X 5jV = 0;G m ) because d, 
C2 and Sj\i are disconnected in Gd- V(A;Ghi) is then 
calculated following similar steps to obtain (fTTT i. 

4) fwf f/ze pieces together: In summary, we have M = 

2c i(2SIT^ 1 - 1 )» and D L (G m ,G Hj ) > 1 for < i < j < M. 
Theorem [4] is thus proved by application of Theorem [7] 



F. Proof of Theorem \5\ 

The probability that, after k observations, the graph cannot 
be identified successfully is 

P((WG fl )=P(u (jiJ - )eVxV Aij) 

where Aij is the error event associated with any AP pair 
There are two types of errors, namely, type I errors A\j ("false 
positive" or "false alarm" errors) and type II error A{J ("false 
negative" or "missed detection" errors). In the following, we 
provide on the bounds on these two types of errors. 

Under the noisy carrier sensing channel assumption, for 
edge G £ D , 

Ifj :=V(X i = l,X j = l\(i,j)€E D ) 

< P {Q t > 0, Qj > 0) q l3 

< p 2 q\ow 

where the upper bound corresponds to the situation that 
is the only edge in En associated with AP % or AP j, and the 
factor qij comes from the fact that APs i and j should not 
sense each other in order to transmit simultaneously. Since 
G £d, q^ < q\ow, and the last inequality follows. 
For the remaining AP pairs (i.e., non-edges), 



1° 



> P (Ti < T Mi , Tj < T Mj ,Qi>0, Q 3 > 0) q rj 



> 



P 2 gup 

d 2 ■ 
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Then, by Assumption [2] we have 

//, • < • /;.;• oo) 

We note that Assumption [2] serves as a sufficient, but not 
necessary, condition for (f30b to hold. This makes it possible 
to distinguish between edges and non-edges based on the 
probability of simultaneous transmission. 
Applying Hoeffding's inequality, we have 



Iij <e 
I 



D 



< exp 



D 



rf 2 



2 

el fc 



and 



(^lf/)=P(i ij >e (i,j)Gf 



4> e 



4 



G £d) 



< exp {-2(e - p 2 q\ ow ) 2 k) 



Thus, 



P(Ay) < max{P(A( ? -),P(A(/)} = exp{-2e 2 fc} 

where we recall from the theorem statement that e = 
P 9 " P 2rf^ 1 ^ C3 ' ) • Thus, the probability that, after k observations, 
the graph cannot be successfully identified is 

HGd + G D ) = P( U (lj)eVxV Aij) (31) 
< n 2 exp{-2e 2 fc}. (32) 



G. Proof of Lemma [2] 

First, based on the definition of hidden interference in ([TJ 
and the assumption made in (f2]i, for any i G Sj, we have 

F{Y j = 0\Xi = l,Xj = l,X SjV = 0,X^ = 0) > Pij . 

On the other hand, for any i G V\{5jUA/j}, since is the only 
source of interference for AP j when A"s = and Xjj. = 0, 
and we assume p$j is independent of the status of APs in the 
network, we have 

F(Yj = 0\Xi = 1, X, = 1, X S] = 0, X Nj = 0) 

= p(y-- = o|a, = i,x v \ {j} = o) = p0 3 -. 

Thus, the true hidden interferer set Sj satisfies the two 
properties described in <j4j and (0. We need to prove that this 
set is unique, i.e., there does not exist another set Sj ^ Sj 
satisfying those two properties. 

We first show that Sj C Sj. If this were not the case, there 
must exist a node i G SAS'a. Since Xc< is a interferer set, 
according to (|5), we have 

F(Yj = 0\Xi = Xj = 1, X s , = 0, X^/-, = 0) = p 0j < py, 

which contradicts the assumption that i G 5j, and 

P(Y^ = 0|X ? = = 1,X S , = Q,X Nj = 0) > pij. 

We now show that 5j = <Sj . If this were not the case, there 



must exist a node i G S'j\Sj, According to ©, we have 

F(Yj = 0\Xi = X, = 1, X sjV = 0, X M] = 0) > py. (33) 
However, since i ^ 6>j then, according to (0, we have 

F(Yj = 0\Xi = Xj = 1, X s;v = 0, X Mj = 0) 
= P(YJ = 0|A, = = 1, X s . = 0, A^ 3 = 0) 
= P0j- 
which contradicts d33l . 



//. Proof of Theorem [6] 

Before we proceed to prove the theorem, we introduce the 
following lemma which will be used to prove the theorem. 



Lemma 3 Let (Ui, W{) for i = 1, 2, . . . , k be a length-k i.i.d. 
sequence of a pair of random variables, each having support 
[0, 1] and respective mean E\Uj\ = up and E[W{] = Wp. 
For any 5 > the sample averages U — ^2i=i Ui and 

W = (lA')Eti^ satisfy 



< exp ■ 



k(6(w a - S w ) - ^5 w f 



where 6 W is any parameter satisfying < S w < wq and 8(wq- 

s w ) - ^s w > 0. 

Similarly, 



W w 



U u 

. r k(S(wo + 5 w )-^ o 6 w ) 2 } r M " w \ 

< exp | 2 j + exp | - -^j 



kSl 



where 8 W is any parameter satisfying 5 W > 0, 6(wq + 5 W ) — 
^5 W > 0. 

WQ W 



Proof: 



W w 



U up 

W w 
U up 
W wo 



> S, W > w - 5, 
> 8, W < w - 5, 

Up 



<F[U-u Q > 6(w - S w ) - ^6 W + P [W - w < S h 

Wo 



< 



exp 



r k(5(wo-S w )-^S w )* } r k Sl X 

X 2 l +exp t-— I' 
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The last inequality follows from Hoeffding's inequality. Fol- 
lowing similar steps, we have 

W~— <S,W>w + 5 v 
W w 

+p(^f <S,W <w + S u 

<v(u-u < -6(w + 8 W ) + ^-8 w ^j + f(w > w + 5 % 

<exp( j +exp |__|. 



For the hidden interference graph learning problem in the 
random setting there are three sources of error. The first is 
incorrect estimation of Gjj, the direct interference graph. We 
denote this event Fq. The other two are missed detection of an 
edge and false detection of an edge (a "false alarm"). Define 
Tij to be the error event associated with given that the 
input Go is correct, i.e., We denote the type I error by 
T{j and the type II error by T\, 



ij 



Set 8 to be in the range p$ < 8 < p m i n . Based on analysis 
above, we now examine the type I and type II errors. 



p(^) = pas,,- > s\ii Sj) 



mm 



scvUJVMij}} W ijtS 

\S\<d 



>8 



ij.Sj 



u, 



w, 



< exp | — 



(37) 



< 



exp 



oxp{-^&} (38) 
K«- w K-«.)'} + ^,f_a}, (39) 



where 8 W is parameter satisfying (8 — p$)w° — 88 w > 0, 
follows from Lemma [3] and d39t follows from Assumption 
A7 and tO- 



Define 

Uij, s : = l{Yj = 0, X s = 0, X Nj = 0, X, = Xj = 1} 
W ijtS ■ = i{x s = 0, X M] = 0, Xi = Xj = 1} 

where S C V\{Mj U {i,j}}, \S\ < d. Then, U ijtS and W^s 
are two random variables. Each has independent and identical 
distribution across k observations. Denote their mean values 
as m° s and s , respectively. Note that 



7/1° 

w ij,S 



\X s =0,X Nj =Q,X i = X j = l) 



> (l-py(l-q iow ) d P 2 /d 



j2 -w°. 



(34) 
(35) 



According to the definition of hidden interferers and 
Lemma |2] if i $ Sj, 

,.o 

mm — ^ — 

scv\{Af,u{i,j}} w^ iS 

\S\<d 

:= min P(K- = 0\X S = 0, Xsr. = 0, X, = X, = 1) 

\S\<d 

= P(Y} = 0|X S . = 0, X Mj = 0, Xi = X = 1) 



L ij,S 3 



w 



o =P$, S P$- 

Otherwise, if i £ Sj, 

mm — — 

\S\<d 

= P(Y- = o\x SA i = o, X Mj =0,X l= X, = 1) 

„,0 



ij,Sj\i 



w 



q — Pij — Pmin 

ij,Sj\i 



(36) 



On the other hand, 

P(jf) = ¥(p t] < 8\i e 5,-) 

mm -=rr — 

SQV\{N- 3 U{i,j}} Wij^s 
\S\<d 



< 5 



u 



U, 



ij,S 



S:SCV\{JV J -U{i,3}} Wij " S 
|S|<d 



< 



E 



|S|<d 



< 6 



< 5 



E 



S:S£V\{^U{i,j}} 
\S\<d 



Ujj,S U ij,S 



< 8— 



(40) 



According to d36b , we have 8 — u $' s < S — p min < 0. Thus, 



U, 



ij,S 



W ij>s < s 



<<y- 



< exp 



exp 



< exp 



fc((5rf-5)( W «, 5 +^)-^^) 2 



/ \2 



(41) 



fc((Pmin - ~ SS'J 2 



exp 



fc(4 



/ ^2 



where S' w is parameter satisfying (p m ; n — 5)w° — SS' W > 0, and 
(|4TT > follows from Lemma [3] Plugging in (1401 1. we have 



cxp 



exp 



(42) 



Let S = H^t^, A = ^lzM, and ^ = ^ = Ag. 
Combining (f3Tb . d39b and d42l , we have 

P(V° U ( U (w& 4) U ( U (iJ)e£H J"//)) 

<P(7°)+ £ F (-^)+ E p (-^) 

(i,j)££H {i,j)e£ H 

< n 2 exp{-2e 2 fc} + n s+2 ■ 2 cxp | - -^j 

< 7/2 + 7/2 = 7. 

The last inequality holds when 

(If 1 2\ 2 / 4 

fc > max j — (logn + - log ; - J , — ( (s + 2) logn + log - 
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